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Abstract — In  recent  years,  both  the  scientific  community  and 
industry  are  focusing  on  moving  computational  resources  from 
the  centralized  Cloud,  with  remote  datacenters,  to  decentralized 
computing,  closer  to  the  source  or  the  so  called  “edge”  of  the 
network.  This  is  in  light  of  the  fact  that  the  Cloud  system 
alone  cannot  efficiently  support  the  future  network  demands  with 
massive  growing  number  of  time-critical  new  applications,  such 
as  self-driving  vehicles,  augmented  reality  (AR)/  virtual  reality 
(VR)  techniques,  advanced  robotics,  and  smart  with  remote 
critical  IoT  applications.  While  decentralized  edge  computing  will 
form  the  backbone  of  future  heterogeneous  networks,  it  is  still 
in  its  infancy  and  there  is  no  comprehensive  platform  to  date.  In 
this  article,  we  propose  a  novel  decentralized  edge  architecture, 
OMNIBUS  solution,  which  enables  continuous  distribution  of 
computational  capacity  for  end-devices  in  different  localities  by 
exploiting  moving  cars,  as  storage  and  computation  resources. 
Scalability  and  adaptability  are  the  main  features  differentiate 
the  proposed  solution  over  the  existing  edge  computing  models. 
The  proposed  solution  has  the  potential  to  scale  infinitely, 
which  will  lead  to  a  significant  increase  in  the  networks  speed. 
In  addition,  the  OMNIBUS  solution  rests  on  developing  two 
predictive  models:  (i)  to  learn  timing  and  direction  of  vehicular 
movements,  as  to  ascertain  computational  capacity  for  a  given 
locale  and  (ii)  to  introduce  a  theoretical  framework  for  sequential 
to  parallel  conversion  in  learning,  optimization  and  caching  under 
contingent  circumstances,  due  to  vehicles  being  in  motion. 


I.  Introduction 

IN  2015,  there  were  more  than  an  estimated  nine  billion 
connected  devices  around  the  world  including  smart  phones 
and  computers  [1];  Cisco  predicts  that  by  2020  50  billion 
“things”  will  be  connected  to  the  Internet  (Fig.  1)  [2].  Accord¬ 
ing  to  Cisco  forecasting  also,  by  2030  the  predictable  number 
of  connected  devices  to  the  Internet  will  reach  up  to  500 
billion  devices  [3].  The  centralized  cloud  system  alone  will  be 
insufficient  to  handle  the  future  networks  efficiently,  wherein 
is  an  environment  of  billion  of  devices  equipped  with  sensors, 
geared  to  collect  huge  amounts  of  data,  drawing  inferences  to 
carry  out  an  action.  To  transfer  massive  amounts  of  data,  from 
the  connected  devices  to  the  Cloud  to  be  analyzed,  creates 
very  crowded  traffic  on  the  network  infrastructure  [4,  5]. 
Moreover,  the  transfer  of  data  back  and  forth  between  the 
Cloud  and  individual  devices  increases  latency  while  many 
new  applications,  such  as  self-driving  vehicles,  remote  surgery, 
AR/VR,  8K  video,  advanced  robotics  in  manufacturing,  and 
drone  surveillance  communication,  require  real-time,  ultra-low 
delay  performance  [6,  4]. 

In  view  of  these  challenges,  data  center  operations  are 
being  pushed  to  the  “edge”  of  the  network.  The  edge  al¬ 
lows  for  certain  time-critical  and  security-sensitive  Artificial 
Intelligence  (AI)  applications  to  operate  either  entirely  on  a 
device,  or  in  conjunction  with  localized  datacenters.  None 
of  the  proposed  edge  architectures  [7,  8,  9,  10],  so  far,  are 
sufficient  to  handle  the  massive  data  traffic  computing  in 


the  future  networks.  Most  of  the  existing  solutions  focus 
on  installing  edge  devices  to  singular  static  locations  (e.g. 
factories,  shopping  malls)  or  around  specific  geographic  areas 
(urban  centers),  which  bear  the  cost  of  additional  infrastructure 
deployments  [11,  12,  13,  14].  There  is  also  a  growing  body 
of  research  on  exploiting  connectivity  among  end-devices,  in 
close  proximity  to  process  tasks  cooperatively  in  local  area 
computation  groups,  though  these  efforts  are  again  limited  in 
scope. 


Fig.  1.  Growth  of  the  connected  devices  (1950-2050) 


The  main  contribution  of  this  article  is  to  develop  a  novel 
mobile  edge  architecture  introduced  as  OMNIBUS  solution 
that  aims  to  advance  a  decentralized  computing  and  storage 
architecture  using  vehicles.  Road  vehicles  emerge  as  the  most 
promising  candidate  for  future  distributed  datacenters  on  the 
edge  of  the  network  for  two  primary  reasons:  (1)  Most  road 
vehicles  display  predictable  movement  patterns  (2)  hardware 
capabilities  for  storage  and  computing  in  cars  are  expected  to 
advance  tremendously  in  the  coming  years.  In  our  architecture, 
clusters  of  cars  form  a  powerful  local  hub  in  individual  areas 
capable  of  offering  high  computational  virtualized  resources 
for  end-devices. 

The  OMNIBUS  solution  will  address  two  interrelated  sci¬ 
entific  challenges:  First,  the  creation  of  a  mobility  prediction 
model  to  determine  the  flow  of  individual  cars  in  a  given 
area  at  a  certain  time.  With  this,  a  local  hub  can  be  created 
for  end-devices  in  that  area  by  using  cars  as  building  blocks. 
Efficient  algorithms  can  be  developed  to  ensure  computing  and 
storage  workloads  for  individual  areas,  as  cars  move  in  and 
out  of  a  given  area.  Second,  in  order  to  minimize  networking 
overheads  as  cars  move  in  and  out  of  a  region,  it  is  necessary  to 
study  required  distribution  of  computing  and  storage  resources 
among  cars.  The  main  scientific  contribution  of  the  OMNIBUS 
solution  will  be  to  initiate  a  theoretical  framework  for  se¬ 
quential  to  parallel  conversion  in  learning,  optimization,  and 
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Fig.  2.  Our  vision  for  the  future  of  edge  computing. 
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caching  algorithms  under  unreliable  circumstances  for  time- 
critical  performance.  Thus,  the  future  of  edge  computing  will 
be:  scalable,  ultra-low  latency  networking  for  the  time  critical 
applications  at  a  flat  cost  and  energy  efficient,  as  illustrated  in 
Fig-2. 

II.  Edge  Computing 

The  Cloud  has  been  an  important  solution  for  companies 
looking  to  scale  their  computational  operations  without  in¬ 
vesting  in  new  infrastructure  and  to  cut  down  on  operational 
costs  by  transferring  their  datacenters  to  cloud  providers. 
While  the  Cloud  catalyzed  growth  and  adoption  of  big  data, 
it  hide  the  costs  and  limitations  related  to  network  latency, 
security,  and  privacy.  As  a  result,  in  recent  years,  discussion 
on  computational  operations  increasingly  moved  from  the 
centralized  Cloud,  with  remote  datacenters,  to  decentralized 
computing,  closer  to  the  source  or  the  so  called  “edge”  of  the 
network. 

Edge  solutions  allow  information  processing  to  take  place 
at  the  device  or  gateway  level.  This  reduces  the  need  to 
transfer  data  back  and  forth  between  the  cloud  or  a  data 
center,  therefore,  decreasing  latency,  bandwidth  requirements, 
and  connectivity  dependencies.  Outside  of  technical  reasons, 
decentralized  computing  is  energy  saving  given  the  power  and 
cooling  costs  associated  with  big  datacenters.  As  importantly, 
research  on  edge  computing  is  driven  by  security  and  privacy 
concerns  related  to  the  centralized  Cloud  on  the  part  of 
states,  firms  and  consumers.  At  the  same  time,  falling  prices 
in  compute  and  storage,  together  with  the  rise  of  machine 
learning,  is  driving  the  adoption  of  edge  computing.  According 
to  IDC,  by  2019,  a  minimum  of  40  %  of  the  data  created  by 
IoT  “will  be  stored,  processed,  analyzed,  and  acted  upon  close 
to,  or  at  the  edge  of  the  network”  [15]. 

Systems  typically  known  as  edge  computing  include  Cyber 
Foraging  [16,  17],  Cloudlets  [18],  Fog  Computing,  and  mobile 
edge  computing  (recently,  the  name  replaced  by  Multi- Access 


Edge  Computing).  Multi- Access  Edge  Computing  was  initi¬ 
ated  by  the  European  Telecommunication  Standards  Institute 
(ETSI)  in  2014  with  a  focus  on  mobile  networks  and  Virtual 
Machine  technology  [19].  In  2017,  its  scope  was  expanded 
to  incorporate  non-mobile  network  requirements,  and  other 
virtualization  technologies.  The  concept  initially  envisioned 
providing  cloud-computing  capabilities  at  the  edge  of  mobile 
networks,  and  within  the  Radio  Access  Network  (RAN), 
by  deploying  mobile  edge  computing  servers  at  Long-Term 
Evolution  (LTE)/LTE-A  macro  base  stations  (eNodeB)  sites, 
3G  Radio  Network  Controller  (RNC)  sites,  and  at  multi-Radio 
Access  Technology  (RAT)  sites.  The  ETSI  initiative  has  also 
been  assigned  a  key  role  to  standardize  the  Application  Pro¬ 
gramming  Interfaces  (APIs)  between  the  mobile  edge  platform 
and  the  applications,  with  an  aim  to  foster  innovation  in  an 
open  environment. 

Fog  computing,  a  concept  introduced  by  Cisco  in  2012, 
is  an  extension  of  the  cloud  computing  paradigm,  from  the 
core  to  the  edge  of  the  network  [20].  Hence,  unlike  Mobile- 
Edge  computing  (MEC),  fog  is  strongly  linked  to  the  cloud, 
unable  to  operate  in  a  standalone  mode.  As  a  result,  there 
has  been  a  special  focus  on  communication  between  the  fog 
and  the  cloud  [21,  22,  23].  Moreover,  unlike  MEC,  which  is 
generally  deployed  at  a  base  station,  fog  nodes  can  be  placed 
anywhere  with  a  network  connection,  e.g.  factory  floor,  top  of 
a  power  pole,  a  railway  track,  a  vehicle,  etc.  [20].  Cisco  offers 
application  platforms  to  simplify  fog  application  development 
and  Cisco  Fog  Data  Services  for  data  analytics  [24]. 

In  parallel,  big  Internet  companies  are  rolling  out  edge 
infrastructure:  Facebook  is  building  micro  datacenters  for 
certain  types  of  applications  and  workloads.  Amazon  has 
launched  Amazon  Web  Services  (AWS)  Greengrass,  allowing 
for  developers  to  move  some  tasks  to  the  device  itself.  There 
are  also  companies  that  describe  themselves  as  edge  compa¬ 
nies,  including  EdgeConneX  and  vXchnge,  that  are  building 
networks  of  urban  datacenters.  For  instance,  a  startup,  Vapor 
IO  developed  micro-centers,  which  can  be  deployed  anywhere. 
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Fig.  3.  General  concept  of  the  proposed  OMNIBUS  architecture  solution  for  decentralized  storage  and  computing 
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Most  of  the  proposed  applications  bear  the  cost  of  additional 
infrastructure  deployment,  whether  it  is  installing  edge  devices 
to  singular  static  locations  (e.g.  factories,  shopping  malls)  or 
around  specific  geographic  areas  (urban  centers).  Furthermore, 
scalability  is  affected  negatively  with  the  massive  increase 
of  people  perform  transactions  within  a  given  specific  edge 
domain.  In  contrast,  with  the  solution  we  have  proposed, 
scalability  rises  with  adoption.  In  other  words,  as  a  huge 
number  of  users  require  computing  and  storage  transactions 
on  the  network,  we  expect  computation  to  become  much  faster 
allowing  for  the  rise  of  a  truly  global  network. 

III.  Edge  computing  with  end-devices 

There  is  a  growing  body  of  work  focusing  on  exploiting 
connectivity  among  end-devices,  in  particular  mobile  devices 
(mobile  cloud  computing),  in  close  proximity  to  process  tasks 
cooperatively  in  local  area  computation  groups  [25].  The 
end  devices  in  a  given  area  communicate  with  each  other 
to  find  resources  and  deliver  requests.  Hence,  the  end-user 
stratum  and  the  edge  stratum  are  merged.  In  the  literature, 
collaboration  is  a  central  feature.  The  authors  in  [26]  were 
proposed  a  vision  where  mobile  devices  form  “mobile  clouds”, 
or  mClouds  to  accomplish  tasks  locally.  In  [27]  a  “Transient 
Clouds”  was  proposed  as  a  collaborative  computing  platform, 
where  nearby  devices  form  an  ad-hoc  network  and  provide  var¬ 
ious  capabilities  as  cloud  services.  In  [28]  a  resource  sharing 
mechanism  was  proposed  to  utilize  capable  mobile  devices 
through  opportunistic  contacts  between  them.  Emphasizing 
resource  aspects  of  mobile  cloud  computing.  Other  work  has 
been  published  in  [29],  which  focused  on  virtual  machine 
technology  (VM)  to  harness  the  full  power  of  local  hardware 
at  the  edges  of  the  Internet,  while  [30]  proposing  an  adaptive 
method  of  resource  discovery  and  address  service  provisioning 
in  opportunistic  computing  environments  for  managing  higher 
load  requests  without  causing  instability  [31].  Sharing  some 
similarities  with  the  OMNIBUS  solution,  an  architecture  called 
Vehicular  Fog  Computing  (VFC)  for  vehicular  applications 
is  proposed  [32].  Although  this  preliminary  work  also  refers 


to  vehicles  (both  moving  and  parked)  as  an  infrastructure 
for  communication  and  computation,  it  does  so  to  service 
vehicles  alone  and  not  servicing  all  other  connected  devices 
and  applications.  In  all  proposals,  regarding  edge  computing 
that  merge  the  end-user  stratum  and  the  edge  stratum,  devices 
share  their  resources  among  each  other  in  a  limited  area. 
Thus,  our  challenge  is  to  bring  together  a  whole  range  of 
technologies  for  decentralized  computing. 

IV.  Omnibus  Architecture  and  Implementation 

The  OMNIBUS  proposal  expands  the  idea  of  end  user 
stratum  and  the  edge  stratum  to  the  next  level.  By  introducing 
a  predictive  platform  for  mobility  patterns  and  for  the  dis¬ 
tribution  of  storage  and  computation  capacity  among  cars,  it 
paves  the  way  for  an  efficient  and  highly  scalable  architecture 
for  device-level  edge  computing.  The  general  concept  and 
structure  of  the  proposed  OMNIBUS  architecture  solution  for 
decentralized  storage  and  computing  illustrated  in  Fig. 3  and 
Fig-4. 

Distributed  machine  learning  and  model  parallelism: 

Our  goal  is  to  speed  the  large-scale  Machine  Learning  (ML)  up 
by  reducing  training  time,  by  means  of  parallel  or  distributed 
computing.  Data  parallelism  and  model  parallelism  are  also 
methods  of  speeding.  Data  parallelism  partitions  the  data. 
In  model  parallelism,  our  solution  partitions  the  ML  model 
itself  to  distribute  the  workload  to  multiple  computational 
workers.  In  our  architecture,  it  is  necessary  to  understand 
ways  to  partition  the  ML  model  according  to  heterogeneity 
and  mobility  of  cars,  as  well  as  ,with  an  eye  to  ensure 
interoperability  on  the  level  of  different  service  providers. 
Given  the  high  number  of  machine  learning  models,  with 
each  model  having  its  own  characteristics  and  representations, 
there  is  no  principle  way  to  implement  model  parallelism. 
In  distributed  machine  learning,  the  synchronization  overhead 
increases  as  the  systems  scales. 

Our  solution  also  leverages  machine  learning  software  meth¬ 
ods  to  optimize  the  hyper-parameters  of  selected  algorithms. 
Furthermore,  it  utilizes  Hadoop  frameworks,  including  Hadoop 
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Fig.  4.  Structure  of  the  proposed  OMNIBUS  architecture  solution  for  decentralized  storage  and  computing 
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Distributed  File  System  (HDFS),  Spark,  and  Cassandra  for 
faster  and  energy  efficient  computation.  The  Hadoop  frame¬ 
work  employs  simple  programming  models  that  allow  for 
the  distributed  processing  of  large  data  sets  across  clusters 
of  computers.  Spark  is  a  compute  engine  for  Hadoop  data 
that  supports  an  entire  range  of  applications,  e.g.  machine 
learning,  stream  processing,  etc.  Cassandra  is  a  high  scalable 
database  with  no  single  point  of  failure,  which  makes  it  ideal 
for  mission  critical  data. 

Next  generation  distributed  ledger  technology:  For  stor¬ 
ing  data  and  enabling  fast  computation  in  the  network,  it  is 
very  important  to  study  directed  acyclic  graphs  (DAG)  based 
ledger  technology.  DAG  may  be  the  primary  data  structure 
for  us  to  create  a  peer-to-peer  network  protocol.  This  will  let 
us  advance,  in  distributed  machine  learning  methods,  to  add 
cognitive  capabilities,  as  well  as,  consensus  mechanisms. 

DAG  is  largely  more  suitable  for  our  solution,  due  to 
its  scalability  potential  and  lesser  processing  power  require¬ 
ments  compared  to  Bitcoin-like  blockchain  ledger  technologies 
[33,  34].  In  the  blockchain  system,  the  block  size  and  the 
time  required  to  generate  a  new  block  puts  limitations  on 
throughput  and  transaction  times.  In  contrast  to  blockchain 
technology,  DAG  transactions  are  not  grouped  into  blocks. 
Each  new  transaction  confirms  at  least  one  previous  transaction 
and  transactions  are  represented  as  “units.”  Hence,  selection 
of  a  branch  selection  and  detection  of  double-transaction  is 
decoupled  from  transaction  verification,  which  allows  nodes 
to  verify  transactions  in  parallel.  As  a  result,  DAG  has  the 
potential  to  achieve  unlimited  scalability. 

However,  as  DAG  based  solutions  emerge  for  high- 
frequency  transaction  scenarios,  problems  may  arise  in  low 
frequency  transaction  [33].  When  an  old  transaction  is  not  able 


to  receive  a  sufficient  number  of  new  transactions  to  verify, 
the  old  transaction  may  not  be  confirmed  in  time  or  not  be 
confirmed  at  all.  To  ensure  a  continuous  system,  our  solution 
optimizes  high  frequency  and  low  frequency  transactions  by 
harmonizing  DAG  and  Blockchain  concepts  as  required. 

Mobility  models:  Mobility  data  contains  the  approximate 
whereabouts  of  individuals  and  is  used  to  explore  the  move¬ 
ment  patterns  of  individuals  across  space  and  time.  The 
vehicular  mobility  maps  have  been  addressed  in  the  literature 
[35,  36,  37].  However,  they  are  not  comprehensive  enough. 
Moreover,  mobility  data  is  among  the  most  sensitive  data 
currently  being  collected.  While  the  discussion  on  individual 
privacy  with  respect  to  mobility  data  is  on  the  rise,  research 
in  this  area  is  still  limited  [38,  39,  40,  41].  The  OMNIBUS 
solution  is  proposed  to  design  a  targeted  mobility  model 
and  addressing  specific  tasks  that  do  not  compromise  an 
individual’s  privacy.  In  doing  this,  leveraging  machine  learning 
software  methods  and  distribute  ledger  technologies  is  very 
important. 

V.  Methodology 

(a)  Aggregated  Mobility  Handling  (AMH):  AHM  aims 
to  accurately  depict  vehicular  behavior  and  focuses  on  the 
following  principles:  (i)  Charting  out  mobility  patterns  of 
moving  cars,  in  order  to  optimize  the  computing  and  storage 
distribution  among  them.  Mobility  patterns  will  be  learned  in 
mixed  autonomy  with  each  car  sharing  the  mobility  patterns 
and  movements  of  the  other  cars,  (ii)  Aggregation,  which 
will  take  place  over  a  combination  of  DAG  and  Blockchain 
based  distributed  ledger  technologies,  depending  on  different 
frequency  scenarios,  (iii)  Leveraging  our  model  to  solve  the 
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massive  routing  problem,  to  bring  Internet  data  to  unconnected 
regions. 

(i)  Solving  the  Mobility  Handling  Problem:  Self-driving 
cars,  ride  sharing,  and  similar  exercises  in  mobility  as  a  service 
(MaaS)  are  turning  transportation  into  mixed  autonomy  sys¬ 
tems,  integrating  AI/ML  technology.  By  reducing  randomness, 
mixed  autonomy  systems,  including  autonomous  and  non- 
autonomous  vehicles,  make  it  possible  to  accurately  depict 
vehicular  behavior  (the  mobility  handling  problem)  [42,  43].  In 
this  relation,  mobility  pattern  challenges  and  requirements  of 
mixed  autonomy  systems  are  studied;  more  specifically,  a  con¬ 
vex  optimization  method  predicting  flow  is  used  to  represent 
coordination  of  automated  vehicles,  which  relies  on  accurate 
traffic  flow  sensing  [44,  45,  5].  MaaS  applications  enable  user 
induced  non-autonomy  systems  to  turn  a  generally  assumed 
to  be  intractable  problem  into  a  mixed-autonomy  problem. 
In  the  context  of  a  larger  dynamical  system,  this  dictates  the 
progression  of  the  integration  or  the  use  of  automation  [44]. 

Our  solution  is  to  first  generalize  the  mobility  handling 
problem,  using  generic  reinforcement  learning  techniques  for 
improved  dimension  reduction.  It  applies  machine  learning 
and  optimization  methods  to  mixed  autonomy  systems,  to 
address  the  automation  problems  of  integration  into  existing 
systems.  It  explores  empirical  and  theoretical  justifications  of 
edge/caching  systems  and  their  optimization  methods  as  a  de¬ 
sign  paradigm.  Through  principled  learning  and  optimization 
methods,  even  a  small  number  of  vehicles  can  be  harnessed 
for  significant  impact  on  the  Internet. 

At  this  point,  real-time  independent  decision  making  for 
the  random  behavior  of  car  passengers  &  drivers  is  a  crucial 
factor.  For  this  reason,  creating  a  sequential  decision-making 
tool/program,  to  model  the  learning  and  decision-making 
processes  of  car  passengers  and/or  drivers,  would  be  the  first 
to  be  acknowledged.  As  commuters  make  repeated  decisions, 
they  learn  over  time  to  optimize  their  route  choices.  This  can 
be  efficiently  modeled  by  a  sequential  process,  where  they 
optimize  a  payoff  function  at  each  step,  linked  to  the  results 
they  experience. 

Our  model  will  also  leverage  the  existing  literature  on 
traffic  systems  that  can  often  be  modeled  using  complex 
(nonlinear  and  coupled)  dynamical  systems.  In  addressing 
complex  traffic  control  problems,  it  should  be  developed  a 
decentralized,  learning-based  solution  involving  interactions 
of  humans,  automated  cars,  and  sensing  infrastructure,  using 
deep  reinforcement  learning.  The  resulting  control  laws  and 
emergent  behaviors  of  cars  will  potentially  provide  insight  for 
the  behavior  of  each  car.  These  insights  will  be  replicated, 
shared,  and  synchronized  among  cars,  over  a  distributed  ledger 
technology,  through  peer-to-peer  ad-hoc  networking  to  under¬ 
stand  the  potential  for  automation  of  flow.  We  have  already 
carried  out  a  simulation  for  a  two  lane  road  to  demonstrate 
the  possibility  of  computation  and  storage  period  [46]. 

Our  novel  computational  framework  that  integrates  open- 
source  deep  learning  and  simulation  tools  can  support  the 
development  of  edge  computing  in  vehicles,  in  the  context 
of  complex  nonlinear  dynamics  of  transportation.  Learned 
policies,  resulting  from  effectively  leveraging  the  structure  of 
human  driving  behavior,  surpass  the  performance  of  state-of- 


Fig.  5.  A  demonstration  of  machine  learning  software  programming  with 
Hadoop  Distributed  File  System  distributed  over  cars.  JVM  is  Java  Virtual 
Machine. 
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the-art  predictors  designed  for  various  mobile  applications, 
such  as  Google  Now.  The  framework  will  initially  focus  on 
highway  traffic,  and  later  will  include  arterial  traffic,  transit, 
as  well  as,  other  modes  of  transportation  /  commuting  (biking, 
MaaS,  carpooling,  etc.) 

(ii)  Distributed  Ledger  as  a  Database:  DAG  is  going  to 
be  used  in  distributed  ledger  technologies  for  storing  data  to 
enable  fast  and  scalable  computation  in  the  network.  DAG 
may  be  the  primary  data  structure  for  the  OMNIBUS  solution 
to  create  a  peer-to-peer  network  protocol.  However,  as  DAG 
based  solutions  emerge  for  high-frequency  traffic  scenarios, 
problems  may  arise  in  low  frequency.  To  ensure  a  continuous 
hybrid  system,  utilizing  sequential  Blockchain  verification  to 
parallel  DAG  verification  mechanism  is  necessary  to  accom¬ 
modate  increasing  and  unreliable  penetration. 

(iii)  Solving  the  Massive  Routing  Problem:  Mobility 
patterns  is  crucially  important  in  the  context  of  providing 
network  access  to  areas  without  Internet,  by  adding  a  spatial 
component  to  the  temporal  sequential  process,  which  may 
be  termed  as  “Cartesian”  machine  learning.  In  doing  so, 
understanding  the  “next  move”  to  be  utilized  as  the  “next 
hop”  in  routing  purposes  is  essential.  The  techniques  we 
develop  in  this  regard  leverage  known  models,  such  as  the 
replicator  dynamics,  mirror  descent,  stochastic  gradient  de¬ 
scent,  and  the  hedge  algorithm.  Overall,  it  is  necessary  to 
achieve  convergence  of  all  these  processes  towards  a  set  of 
equilibria  based  on  assumptions  made  on  the  learning  process 
used  by  humans  in  decision-making,  taking  into  consideration 
constraints  imposed  by  transportation. 

(b)  Decentralizing  computing  and  storage:  Anticipating 
the  demand  for  each  edge  car  and  deploying  adequate  car 
resources  are  very  important  to  sufficiently  meet  locational 
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Fig.  6.  A  Distributed  Machine  Learning  Example 


demands.  For  instance,  when  a  single  car  moves  out  of  the 
local  area,  its  storage  and  computing  resources  will  need  to 
be  distributed  across  the  cars  that  remain  in  that  area  and  new 
cars  that  enter  the  area.  Thus,  developing  predictive  algorithms 
is  very  necessary  to  optimally  distribute  computing  and  storage 
resources  among  cars,  taking  into  account  challenges  related  to 
redundancy,  security,  heterogeneity  of  devices,  and  federation 
(that  interoperability  is  ensured  on  the  level  of  different 
service  providers).  In  developing  these  algorithms,  creating 
and  employing  a  global  mobility  map  is  a  key  element. 
Leveraging  and  combining  existing  mesh  networking  systems 
for  car-to-car  car-to-device  communication  is  also  necessary. 
In  addition,  studying  how  to  distribute  computing  and  storage 
across  the  entire  system,  i.e.  whether  data  should  remain  local 
(shared  among  cars)  or  be  sent  to  the  cloud,  would  be  equally 
important. 

Another  important  task  would  be  building  parallel  systems 
that  harnessing  thousands  of  simple  and  efficient  computing 
and  storage  resources  would  be  a  practical  solution  to  sustain 
growth  without  scaling  technology.  To  this  end,  our  architec¬ 
ture  parallelizes  algorithms.  Tasks  need  to  be  implemented 
speculatively  and  in  an  out  of  order  manner.  Moreover,  thou¬ 
sands  of  tasks  need  to  be  speculated  efficiently  prior  to  the 
earliest  active  task  in  order  to  reveal  sufficient  parallelism.  To 
develop  parallel  algorithms  and  uncover  an  abundant  paral¬ 
lelism  in  large-scale  applications,  a  new  techniques  need  to 
develop  to  exploit  locality  and  nested  parallelism.  In  order  to 
generate  parallel  algorithms  in  cars,  it  should  be  focused  on 
the  following: 

(i)  Ensuring  consensus  among  multiple  cars  working  to¬ 
wards  a  common  goal,  for  instance,  when  all  cars  involved  are 
solving  one  optimization  problem  together,  yet  with  different 
partitions  of  the  dataset. 

(ii)  Redistribution  in  the  emergency  of  one  of  the  cars  being 
disabled  and  leaving  the  cluster.  The  issue  is  to  restore  the 
system  without  restarting  it. 

(iii)  Communication  and  Managing  resources.  Communica¬ 


tion:  computation  requires  a  lot  of  input/output  (I/O)  (e.g.  disk 
read  and  write)  and  data  processing  procedures.  OMNIBUS 
solution  distributes  storage  systems  to  enable  faster  I/O  and 
non-blocking  data  processing  procedures  for  different  types  of 
environments  (e.g.  single  node  local  disk,  distributed  file  sys¬ 
tems,  etc.).  Managing  resources:  the  issue  is  one  of  managing 
resources  within  a  given  cluster  of  cars  to  meet  all  demands 
while  maximizing  capacity. 

(iv)  Designing  a  programming  model  to  improve  efficiency. 
A  new  programming  model  is  employed  to  achieve  distributed 
computing  and  storage  algorithms,  in  the  same  way  as  non- 
distributed  ones,  which  requires  less  coding  and  improves 
efficiency.  Studying  programming  in  a  single-node  fashion, 
while  automatically  amplifying  the  program  with  distributed 
computing  techniques,  is  also  necessary.  Applying  model 
parallelism  partitioning,  to  the  ML  model  itself,  to  distribute 
the  workload  to  multiple  computational  cars  is  also  very  im¬ 
portant,  as  well  as  developing  a  unique  data  analytics  engine, 
specifically,  targeting  the  car  to  car  and  car  to  connected 
device,  for  big  data  processing. 

VI.  Challenges 

(a)  Decentralizing  computing  and  storage:  The  demand 
for  each  edge  car  needs  to  be  anticipated  so  that  adequate 
car  resources  are  deployed  to  meet  locational  demands.  For 
instance,  when  a  single  car  moves  out  of  the  local  area,  its 
storage  and  computing  resources  will  need  to  be  distributed 
across  other  cars  that  remain  in  that  area  and  new  cars 
that  enter  the  area.  Our  architecture  leverages  an  excessive 
mobility  map  in  developing  predictive  algorithms  to  optimally 
distribute  computing  and  storage  resources  among  cars.  In 
doing  this,  it  is  necessary  to  take  into  account  challenges 
related  to  redundancy,  security,  heterogeneity,  and  federation 
that  interoperability  are  ensured  on  the  level  of  different 
service  providers  and  mobility  handling.  OMNIBUS  model 
leverages  and  combines  existing  MANET  (Mobile  Ad-hoc 
Network),  VANET  (Vehicular  Ad-hoc  Network)  and  DTN 
(Delay  Tolerant  Networking)  technologies  for  car-to-car  and 
car-to-device  communication.  Moreover,  this  model  also  looks 
at  how  to  distribute  computing  and  storage  across  the  entire 
system,  i.e.  whether  data  should  remain  local  (shared  among 
cars)  or  be  sent  to  the  cloud. 

(b)  Aggregated  Mobility  Handling  (accurately  depict¬ 
ing  vehicular  behavior):  At  present,  there  is  no  large-scale 
mobility  map;  available  models  are  not  adaptable,  and  they 
do  not  adequately  address  privacy  concerns.  In  our  architec¬ 
ture,  developing  a  mobility  prediction  model  is  critical  for 
optimizing  the  allocation  of  computing  and  storage  resource 
sharing  among  them.  These  mobility  patterns  enable  us  to 
provide  for  offloading  decisions,  as  well  as,  to  control  en¬ 
ergy  consumption  and  bytes  of  data  transfer  [47,  48].  Our 
solution  will  use  databases  provided  by  mobile  operators, 
smart  transportation  systems,  etc.  to  build  our  mobility  model. 
Mobility  patterns  will  be  learned  and  each  car  shares  mobility 
patterns  and  movements  of  other  cars  via  mesh  networking 
technologies.  OMNIBUS  model  uses  a  combination  of  DAG 
based  and  Blockchain  based  distributed  ledger  technologies 
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depending  on  different  frequency  scenarios  for  aggregation. 
Our  problem  is  complex  since  it  focuses  on  continuously 
moving  cars  exchanging  data  with  each  other,  in  order  to  keep 
the  system  alive  in  any  given  location.  Proposed  solutions  to 
communication  regarding  moving  vehicles  are  so  far  limited  to 
highly  ordered  environments.  In  contrast,  OMNIBUS  solution 
seek  to  develop  communication  protocols  for  cars  in  a  highly 
chaotic  environment.  To  do  this,  OMNIBUS  model  leverages 
MANET,  VANET,  and  DTN  technologies. 

(c)  Heterogeneity  issues:  Heterogeneity  of  resources,  in 
terms  of  computational  and  storage  capabilities,  as  well  as, 
their  ad-hoc  availability,  is  necessary  to  optimize.  Hetero¬ 
geneity  is  important  in  deciding  which  application  component 
needs  to  be  deployed  and  where  it  should  it  deployed  [21]. 
This  involves  developing  algorithms  to  address  heterogeneity 
taking  into  account  the  limitations  of  specific  nodes.  For 
instance,  in  a  content  delivery  use  case,  storage  limitations 
of  the  caches  are  incorporated  into  the  caching  algorithm. 
Furthermore,  while  node  degrees  can  be  optimized,  each  cars 
CPU  will  need  to  compute  multiple  items  at  the  same  time. 
Ensuring  that  CPUs  are  not  overwhelmed  will  be  a  key 
consideration  in  developing  our  algorithms. 

(d)  Federation  issues:  In  our  architecture,  cars  are  geo¬ 
graphically  distributed  on  a  very  wide  scale  and  could  be 
assigned  to  different  service  providers.  Moreover,  the  cloud 
can  be  operated  by  a  different  provider.  Our  architecture  will 
be  designed  in  a  way  that  interoperability  is  ensured  on  the 
level  of  federation  of  different  service  providers.  This  means 
developing  a  consensus  protocol  to  understand  capabilities  of 
a  variety  of  cars  using  different  providers. 

(e)  Handling  mobility  of  end-users:  In  the  case  that 
end-users  physically  move,  the  system  should  be  able  to 
continuously  provide  them  with  the  same  quality  of  expe¬ 
rience,  without  interrupting  the  service.  Furthermore,  in  the 
scenario  that  several  end-users  are  watching  the  same  video, 
the  algorithm  may  need  to  allow  for  the  mobility  engine  to 
copy  the  video  to  be  pushed  to  the  destination  point.  Similarly, 
as  a  car  moves  in  our  system,  resource  displacement  takes 
place  with  implications  on  resource  management  algorithms. 
In  addressing  this  challenge,  studying  byzantine  fault  tolerance 
methods,  for  scenarios  where  a  car’s  data  center  may  fail 
or  move,  is  important  and  there  is  imperfect  information  on 
whether  a  component  has  failed  or  moved.  OMNIBUS  solution 
applies  model  parallelism  partitioning  the  ML  model  itself  to 
distribute  the  workload  to  multiple  computational  cars.  In  this 
architecture,  understanding  ways  to  partition  the  ML  model 
according  to  heterogeneity,  federation,  and  mobility  constantly 
be  investigated. 


VII.  Impact 

The  OMNIBUS  solution  will  have  a  far-reaching  impact  in 
three  areas  summarized  below:  (i)  5G  and  6G  Ultra-Reliable 
Low  Latency  Communication  (URLLC):  Ultra  low-latency, 
the  solution’s  key  objective,  is  to  enable  a  range  of  new 
applications  (Smart  driving,  Smart  Grids,  Augmented  Reality, 
and  IoT  in  general),  which  depend  on  ultra-reliable  and  ultra- 
low-latency  connectivity.  The  OMNIBUS  solution  is  driven  by 


Fig.  7.  A  Distributed  Storage  Example 
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the  need  to  remove  present  and  future  bottlenecks  in  commu¬ 
nication  networks  and  to  prepare  the  groundwork  for  future  5G 
and  6G  heterogenous  networks  [49,  50,  51,  52,  53,  54].  The 
solution  responds  to  the  market  need  for  a  comprehensive  edge 
network  platform  for  faster  and  more  reliable  data  processing. 
Attempts  to  move  computing  closer  to  the  network  (Cloudlets, 
Fog  Computing,  MEC)  are  not  scalable.  In  contrast,  our 
architecture  has  the  potential  to  scale  almost  infinitely  and 
increases  the  speed  of  networks  while  it  grows.  Our  ambitions 
go  further  and  our  research  paves  the  way  for  employing  all 
connected  devices,  including  persons  with  smart  phones  and 
all  IoT  applications,  as  computing  and  storage  centers.  The 
decentralized  network  architecture,  we  propose,  opens  up  new 
possibilities  for  network  slicing,  hence,  lower  latency,  more 
storage  capacity,  more  network  resilience  and  security,  and  less 
energy  waste.  By  breaking  down  and  distributing  computing 
and  storage  resources  for  intermittent  networking,  our  solution 
leads  the  way  for  a  scalable  collaborative  communication 
network. 

(ii)  Decentralized  Internet:  Our  framework  architecture 
can  be  used  for  high  latency,  delay  tolerant  Internet  access 
for  more  than  3.9  billion  of  the  world’s  population,  who  re¬ 
main  offline  today.  A  decentralized  storage  and  computational 
framework,  like  the  one  we  propose,  is  more  reliable  than 
current  digital  infrastructures,  which  are  vulnerable  to  disaster 
situations,  where  a  single  point  of  failure  in  the  infrastruc¬ 
ture  can  bring  down  the  entire  communication  network.The 
OMNIBUS  solution  leverages  our  mobility  model  to  solve 
the  massive  routing  problems  and  predictive  algorithms  we 
develop  to  optimally  distribute  computing  and  storage  among 
cars,  to  bring  Internet  data  to  unconnected  regions.  In  this 
regard,  the  OMNIBUS  solution  is  expected  to  open  new 
directions  in  research  on  ad  hoc  technologies  and  DTN-based 
data  mules.  As  opposed  to  URLLC,  this  can  be  named  as 
UCHLC:  Ultra  Coverage  High  Latency  Communication. 

(iii)  Smart  transportation:  Our  solution  will  have  consid¬ 
erable  impact  on  smart  transportation  systems,  including  traffic 
systems  and  edge  computing  in  vehicles.  It  has  the  potential 


to  redirect  research  on  traffic  systems  towards  a  decentralized, 
learning-based  study  of  complex  traffic  control  problems, 
involving  interactions  of  humans,  automated  vehicles,  and 
sensing  infrastructure.  The  resulting  control  laws  and  emergent 
behaviors  of  cars  will  potentially  provide  insight  for  the  behav¬ 
ior  of  each  car.  These  insights  will  be  replicated,  shared,  and 
synchronized  among  cars,  over  a  distributed  ledger  technology 
through  peer-to-peer  ad-hoc  networking,  to  understand  the 
potential  for  automation  of  flow. 

Furthermore,  our  research  can  be  employed  by  the  research 
community  as  a  new  computational  framework  that  integrates 
open-source  deep  learning  and  simulation  tools,  to  support  the 
development  of  edge  computing  in  vehicles,  in  the  context  of 
complex  nonlinear  dynamics  of  transportation. 

VIII.  Summary 

A  breakthrough  development  is  imminently  needed,  in  order 
to  support  the  demands  of  the  data  heavy  network  edge.  Even 
though,  there  are  various  proposed  solutions  regarding  Edge 
computing,  most  of  them  are  substantially  limited  and  not 
easily  scalable.  At  this  point,  the  OMNIBUS  solution  offers  a 
breakthrough  technology,  by  bringing  together  a  full  spectrum 
of  science  and  engineering  used  for  different  innovations  and 
has  the  potential  to  upend  the  ecosystem  for  future  network 
efforts.  This  solution,  especially,  paves  the  path  for  more  effi¬ 
cient  and  highly  scalable  device-level  edge  computing  archi¬ 
tectures.  It,  specifically,  develops  serious  key  objectives  to  en¬ 
able  a  range  of  new  applications  (Smart  driving,  Smart  Grids, 
Augmented  Reality,  and  IoT  in  general),  which  depend  on 
ultra-reliable  and  ultra-low-latency  connectivity.  OMNIBUS 
solution  has  the  potential  to  redirect  research  on  traffic  systems 
towards  a  decentralized,  learning-based  study  of  complex 
traffic  control  problems.  Furthermore,  it  can  be  employed  by 
the  research  community  as  a  new  computational  framework 
that  integrates  open-source  deep  learning  and  simulation  tools, 
to  support  the  development  of  edge  computing  in  vehicles,  in 
the  context  of  complex  nonlinear  dynamics  of  transportation. 
It  can  support  and  further  develop  state-of-the-art  predictors 
designed  for  various  mobile  applications,  such  as  Google  Now; 
it  will  also  make  existing  map  databases  more  accurate  and 
more  interactive.  The  OMNIBUS  solution  can  be  the  most 
important  and  influential  building  block  for  future  network 
efforts. 
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